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Description 

Cross-Reference to Related Applications 

5 This application Includes reference to U.S. Patent No. 4,628,529. assigned to the same assignee as the present 

application. Furthermore, this application contains subject matter related to U.S. Patent No. 4,630,304 and U.S. Patent 
No. 4.630»305, also assigned to the same assignee as the present application. 

Background of the Invention 

10 

1. Field of the Invention 

The present invention relates generally to acoustic noise suppression systems. The present invention is more 
specifically directed to improving the speech quality of a noise suppression system employing the spectral subtraction 
IS noise suppression technique. 

2. Description of the Prior Art 

Acoustic noise suppression in a speech communication system generally serves the purpose of improving the 
20 overall quality of the desired audio signal by filtering environment background noise from the desired speech signal. 
This speech enhancement process is partk;ularly necessary in environments having abnornnally high levels of ambient 
background noise, such as an aircraft, a moving vehicle, or a noisy factory. 

The noise suppression technique described in the aforementioned patents is the spectral subtraction - or spectral • 
gain modification -technique. Using this approach, the audio input signal is divided into individual spectral bands by 
25 a bank of bandpass filters, and partrcular spectral bands are attenuated according to their noise energy content. A I 
spectral subtraction noise suppression prefilter utilizes an estimate of the background noise power spectral density to 
generate a signal-to-noise ratio (SNR) of the speech in each channel, which, in turn, is used to compute a gain factor 
for each individual channel. The gain factor is used as a pointer for a look-up table to determine the attenuation for 
that particular spectral band. The channels are then attenuated and recombined to produce the noise-suppressed 
30 output waveform. 

In specialized applications involving relatively high background noise environments, most noise suppression tech- 
niques exhibit significant performance limitations. One example of such an applicatbn is the vehicle speakerphone 
option to a cellular mobile radio telephone system, which provides hands-free operation for the automobile driver. The 
mobile hands-free microphone Is typically located at a greater distance from the user, such as being mounted overhead 

55 on the visor. The more distant microphone delivers a much poorer signal-to-noise ratio to the land-end party drive to 
road and wind noise conditions. Although the received speech at the land-end is usually intelligible, continuous expo- 
sure to such background noise levels often increases listener fatigue. 

Although most prior art techniques perfomn sufficiently well under nominal background noise conditions, the per- 
fomnance of known techniques becomes severely limited in such specialized applications of unusually high background 

40 noise. Typical spectral subtraction noise suppression systems may reduce the background noise level over the voice 
frequency spectrum by as much as 10 dB without seriously affecting the speech quality. However, when the prior art 
techniques are used in relatively high background noise environments requiring noise suppression levels approaching 
20 dB, there is a substantial degradation in the quality characteristics of the voice. Furthermore, in rapidly-changing 
high noise environments, a severe low frequency noise flutter develops in the output speech signal which resembles 

45 a distant "jet engine roar" sound. This noise flutter is inherent in as spectral subtraction noise suppression system, 
since the individual channel gain parameters are continuously being updated in response to the changing background 
noise environment. 

The background noise flutter problem was indirectly addressed but not eliminated through the use of gain smooth- 
ing. For example, R.J. McAulay and M.L Malpass, in the article entitled "Speech Enhancement Using a soft-Decision 

so Noise Suppression Filter", IEEE Trans. Amust.. Speech. Signal Processing. Vol. ASSP-28, No.2 (April 1980), pp. 
137-145, propose the use of gain smoothing on a per-frame basis to avoid the introduction of discontinuities in the 
output waveform. Since the introduction of gain smoothing can cause the noise suppression prefilter to be slow to 
respond to a leading edge transition (which would result in speech distortion), a weighting a factor of 1 or 1/2 was 
chosen such that the prefilter responds immediately to an increase in gain while tending to snrrcoth any decrease in 

55 gain. Unfortunately, excessive gain snrxjothing still produces noticeable detrimental effects in voice quality, the primary 
effect being the apparent introduction of a tail-end echo or "noise pump" to spoken words. There is also a significant 
reduction in voice amplitude with large amounts of gain smoothing. 

The noise flutter performance was further improved by the technique of smoothing the noise suppression gain 
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factors for each individual channel on a per-sannple basis instead of on a per-frame basis. Per-sample smooth ing, as 
well as utilizing different smoothing coefficients for each channel, is described in U.S. Patent No, 4,630,305. entitles 
"Automatic Gain Selector for a Noise Suppression System." However, none of the known prior art techniques appreciate 
that the primary source of the channel gain discontinuities is the inherent fluctuation of background noise in each 

s channel from one frame to the next. In known spectral subtraction systems, even a 2 dB SNR variation would create 
a few dB of gain variation, which is then heard as an annoying background noise flutter. Hence, the flutter problem has 
never been effectively solved. 

Moreover, narrowband noise - that which has a high power spectral density in only a few channels - further 
complicates the background noise flutter problem. Since these few high energy noise channels would not be attenuated 

10 by the background noise suppression, the resultant audio output has a "running water" type of characteristic. Narrow- 
band noise bursts also degrade the accuracy of the background noise update decision required to perform noise sup- 
pression in changing background noise environments. 

Since the gain factors are chosen by SNR estimates, which are determined by the speech energy in each channel 
(signal) and the current background noise energy estimate in each channel (noise), the performance of the entire noise 

16 suppression system is based upon the accuracy of the background noise estimate. The statistk^s of the background 
noise are estimated during the time when only background noise is present, such as during the pause in human speech. 
Therefore, an accurate speech/noise clarification must be made to determine when such pauses in speech are occur- 
ring. 

It is widely known that the energy histogram technique for distinguishing between background noise and speech 
20 perform sufficiently well in normal ambient noise environments. See, e.g., W.J. Hess, "A Pitch Synchronous Digital 
Feature Extraction system for Phonemic Recognition of Speech." IEEE Trans. Acoust.. Speech. Signal Processing. 
Vol. ASSP-24, No. 1 (February 1976), pp. 14-25. Energy histograms of acoustk; signals exhibit a bimodal distribution 
in which the two modes correspond to noise and speech. Thus, an appropriate threshokJ can be set between the two 
modes to provide the speech/noise classification. However, the distinction between background noise energy and 
25 unvoiced speech energy in relatively high background noise environments is unclear. Consequently, the task of accu- 
rately finding the two modes of the energy histogram, and setting the appropriate threshokJ between them , is extremely 
difficult. 

To accomnrxxJate changing noise backgrounds, McAulay and Malpass implement and adaptive threshold by con- 
stantly nx>nitoring the histogram energy on a frame-by-frame basts, and updating the threshold utilizing different decay 

30 factors. Alternatively, U.S. Patent No. 4,630,304 utilizes an energy valley detector to perfomn the speech/noise decision 
based upon the post -processed signal energy ~ signal energy available at the output of the noise suppression system 
~ to determine the detected speech minimum. Thus, the accuracy of the background noise estimate is improved since 
it is based upon a much cleaner speech signal. 

However, neither prior art technkijue is properly responsive to a sudden, strong increase in background noise level. 

35 These background noise estimate updating decision processes interpret a sudden, loud noise level rise as speech, 
such that no updates are performed. The energy histogram or valley detector has a slow adaptation characteristic 
which will eventually adapt to the higher noise level. However, this adaptation characteristic does lead to incorrect 
noise updates on the weaker energy portions of speech. This erroneous decisbn significantly degrades the perform- 
ance of the noise suppressbn system. 

40 A need, therefore exists for an improved acoustic noise suppression system which addresses the problems of 

background noise fluctuations, narrowband noise bursts, and sudden background noise. 

Summary of the Inventbn 

4$ According to a first aspect of the present invention there Is provided a noise suppression system for attenuating 

the background noise from a noisy input signal to produce a noise-suppressed output signal, said noise suppression 
system comprising: means for separating the input signal into a plurality of pre-processed signals representative of 
selected frequency channels; means for generating estimates of the signal-plus-noise energy and the noise energy in 
each individual channel; and means for producing a gain value for each individual channel in response to said channel 

50 energy estimates; the system characterized in that: said gain values have a minimum gain value for each channel; 
said means for producing a gain value includes threshold means for allowing gain values above said minimum gain 
value to be produced only when said signal-plus-noise energy estimates exceed said noise energy estimates by a 
predetermined anrount; and means for modifying the gain of each of said plurality of pre-processed signals in response 
to said gain values to provide a plurality of post-processed signals. 

55 In another aspect of the present invention there is provided a method of attenuating the background noise from a 

noisy input signal to produce a noise-suppressed output signal in a noise suppression system comprising the steps 
of: separating the input signal into a plurality of pre-processed signals representative of a number N of selected fre- 
quency channels; generating an estimate of the energy in each individual channel; generating and storing an estimate 



3 

03/28/2003, EAST Version: 1.03.0002 



EP 0 380 563 B1 

of the background noise power spectral density of said pre-processed signals; and generating an estimate of the signal- 
to-noise ratio (SNR) in each individual channel based upon said background noise estimates and said channel energy 
estimates; the method characterized by the steps of: producing a gain value for each individual channel in response 
to said channel SNR estimates, wherein said gain values have a range of minimal values; and wherein said gain values 

s producing step includes the steps of: providing a predefined SNR threshold and comparing (said channel SNR esti- 
mates to said predefined SNR threshold such that channels having SNR estimates below said SNR threshold produce 
gain values within said minimal range; and modifying the gain of each of said plurality of pre-processed signals in 
response to said gain values to provide a plurality of post -processed signals. 

The present invention advantageously provides an improved method and apparatus for suppressing background 

10 noise in high background noise environments without significantly degrading voice quality. Moreover, the present in- 
vention addresses the problem of background noise fluctuation without requiring large amounts of gain smoothing. 
Additionally, a preferred embodiment of the present invention provides a spectral subtraction noise suppression system 
that compensates tor the detrimental effects of narrowband noise bursts. Furthermore, the present invention provides 
an improved background noise estimatbn mechanism that is not misled by low energy portions of speech, yet still 

75 provides correction for sudden, strong increases in background noise levels. 

In the present invention, the improvements to the noise suppression system relate to the addition of an SNR 
threshold mechanism to eliminate minor gain fluctuations for low SNR conditions, a voice metric calculator for producing 
a more accurate background noise estimate update decision and a channel SNR modifier to suppress narrowband 
noise bursts. 

20 More specifically, the first aspect of the present invention pertains to the addition of an SNR threshold mechanism 

for providing a predetermined SNR threshold which the channel SNR estimates must exceed before a gain value above 
a predefined minimum gain value can be produced, in a preferred embodiment, the SNR threshold is set at 2.25 dB 
SNR, such that minor background noise fluctuations do not create step discontinuities in the noise suppression gains. 
In a preferred embodiment, the a voice metric calculator is utilized to perform the speechAioise classification for 

25 the background noise update decision using a two-step process. First, the raw SNR estimates are used to index a 
voice metric table to obtain voice metric values for each channel. A voce metric is a measurement of the overall voice- 
like characteristics of all of the channel energies. The individual voice channel metric values are summed to create a , 
first multi-channel energy parameter, and then compared to a background noise update threshold. If the voice metric 
sum does not meet the threshold, the input frame in deemed to be noise, and a background noise update Is performed. 

30 Secondly, the time since the occurrence of the previous background estimate update is constantly monitored. If too 
much time has passed since the last update, e.g. 1 second, then it is assumed that a substantial increase in noise has 
occurred, and a background noise update is performed regardless of whether it looks like a voice frame. This second 
test is based on the assumptnn that speech seldom contains continuous high energy levels in all channels for more 
than one second, which would be the case for a sudden, loud noise level increase. The voice metric algorithm incor- 

35 porating the two-step decisbn process provides a very accurate background noise estimate update signal. 

In another preferred embodiment, a channel SNR modifying mechanism provides a second multi-channel energy 
parameter in response to the number of upper-channel SNR estimates which exceed a predetermined energy threshold, 
e.g. 6 dB SNR. If only a few channels have an energy level above this threshold (such as would be the cased for a 
narrowband noise burst), the measured SNR for those particular channels would be reduced. Moreover, if the afore- 

40 mentioned voice metric sum is less than a metric threshold (which would indicate that the frame was noise), all channels 
are similarly reduced. This SNR modifying technique is based on the assumption that typical speech exhibits a majority 
of channels having signal-to-noise ratios of 6 dB or greater. 

An exemplary embodiment of the present invention will now be described with reference to the accompanying 
drawings. 

45 

Brief Description of the Drawings 

Figure 1 is a detailed block diagram illustrating a preferred embodiment of the improved noise suppression system 
according to the present invention; 
so Figure 2 is a graph representing voice metric values output as a function of SNR estimate index values input for 

the voice metric calculator block of Figure 1 ; 

Figure 3 Is a representative gain table graph illustrating the overall channel attenuation for particular groups of 
channels as a function of the SNR estimate; and 

Figures 4a through 4f are flowcharts illustrating the specific sequence of operatrons performed in accordance with 
55 the practice of the preferred embodiment of the present invention; 

Figures 5a and 5b are flowcharts illustrating the general sequence of operations performed in the prbr art; 
Figures 6a to 6d are detailed flowcharts illustrating specific sequences of operations shown in Figures 5a and 5b. 
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Detailed Description of the Preferred Embodiment 

Figure 1 is a detailed block diagram of the preferred embodiment of the present invention. All the elements of 
Figure 1 having reference numerals less than 600 correspond to those of U.S. Patent No.4,628,529 - Borth et al., and 
5 are shown in Figures 5a and 5b and Figures 6a to 6d of the present application. The additional circuit components 
having reference numerals greater than 600 represent the improvements to the system, and will be subsequently 
described herein. 

First, concerning the prior art Borth patent U.S. 4,629,529 (and now expressly referring to Figures 5a and 5b and 
Figures 6a to 6d), FIG. 5a/b is a flowchart illustrating the overall operation of the prior art noise suppression mechanism. 
10 This generalized flow diagram is subdivided Into three functional blocks: noise suppression loop 604 - further described 
in detail in FIG. 6a; automatic gain selector 61 5 - described in more detail in FIG. 6b; and automatic background noise 
estimator 621 - illustrated in FIGS. 7c and 76. 

The operation of the improved noise suppression system begins with FIG. 5a at initialization block 601 . When the 
system is first powered-up. no old background noise estimate exists in energy estimate storage register 585, and no 
IS noise energy history exists In energy valley detector 570. Consequently, during initialization 601 , storage register 585 
is preset with an Initialization value representing a background noise estimate value corresponding to a clean speech 
signal at the input: Similarly, energy valley detector 570 is preset with an initialization value representing a valley level 
corresponding to a noisy speech signal at the input. 

Initialization block 601 also provides initial sample counts, channel counts, and frame counts. For the purposes of 
20 the following discussion, a sample period is defined as 125 microseconds corresponding to an 8 Khz sampling rate. 
The frame period is defined as being a 10 millisecond duration time interval to which the input signal samples are 
quantized. Thus, a frame corresponds to 80 samples at an 8 Khz sampling rate. 

Initially, the sample count is set to zero. Block 602 increments the sample count by one, and a noisy speech sample 
is input from A/D converter 510 in block 603. The speech sample is then pre-emphasized by pre-emphasis network 
25 520 in block 605. 

Following pre-emphasis, block 606 initializes the channel count to one. Decision block 607 then tests the channel 
count number. If the channel count is less than the highest channel number N, the sample for that channel is bandpass 
filtered, and the signal energy for that channel is estimated in block 608. The result is saved for later use. Block 609 
smoothes the raw channel gain for the present channel, and block 610 modifies the level of the bandpass-filtered 
30 sample utilizing the smoothed channel gain. The N Channels are then combined (also in block 610) to form a single 
processed output speech sample. Block 611 increments the channel count by one and the procedure in blocks 607 
through 611 is repeated. 

If the result of the decision in 607 is true, the combined sample is de-emphasized in block 612 and output as a 
modified speech sample in block 61 3. The sample count is then tested in block 61 4 to see if all samples in the current 

35 frame have been processed. If samples remain, the loop consisting of blocks 602 through 61 3 is re-entered for another 
sample. If all samples in the current frame have been processed, block 614 initiates the procedure if block 615 for 
updating the individual channel gains. 

Continuing with FIG. 5b, block 616 initiates the channel counter to one. Block 617 tests if all channels have been 
processed. If this decision is negative, bkx:k 618 calculates the index to the gain table for the particular channel by 

40 forming an SNR estimate. This index is then utilized in block 61 9 to obtain a channel gain value from the look-up table. 
The gain value is then stored for use in noise suppression kxDp 604. Block 620 then increments the channel counter, 
and block 617 rechecks to see if all channel gains have been updated. If this decision is affimnative, the background 
noise estimate is then updated in block 621 . 

To update the background noise estimate, US-A-4, 628,529 first simulates post-processed energy in block 622 by 

45 multiplying the updated raw channel gain value by the pre-processed energy estimate for that channel. Next, the 
simualted post-processed energy estimates are combined In btock 623 to form an overall channel energy estimate for 
use by the valley detector Block 624 compares the value of this overall post-processed energy estimate to the previous 
valley level. If the energy value exceeds the previous valley level, the previous valley level is updated in block 626 by 
increasing the level with a slow lime constant. This occurs when voice, or a higher background noise level, is present. 

50 If the output of deciston bkxk 624 is negative (post-processed energy less than previous valley level), the previous 
valley level is updated in block 625 by decreasing the level with a fast time constant. This previous valley level decrease 
occurs when minimal background noise is present. Accordingly, the background noise history is continually updated 
by slowly increasing or rapidly decreasing the previous valley level towards the current post-processed energy estimate. 
Subsequent to the updating of the previous valley level (block 625 or 626), decision block 627 tests if the current 

55 post-processed energy value exceeds a predetermined noise threshold. If the result of this comparison is negative, a 
decision that only noise is present is made, and the background noise spectral estimate is updated in block 628. This 
corresponds to the closing of channel switch 575. If the result of the test is affirmative, indicating that speech is present, 
the background noise estimate is not updated. In either case, the operatbn of background noise estimator 621 ends 
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when the sample count is rest in block 629 and the frame count is incremented in block 630. Operation then proceeds 
to block 602 to begin noise suppression on the next frame of speech. 

The flowchart of FIG. 6a illustrates the specific details of the sequence of operation of noise suppression loop 604. 
For every sample of input speech, block 701 pre-emphasizes the sample by implement the fitter described by the 
s equation: 

Y(nT)=X(nT)-K,[X((n-1)T)] 

10 where Y(nT) is the output of the filter at time nT, T is the sample period, X(nT) and X((n-1 )T) are the input samples at 
times nT and (n-1)T respectively, and the pre-emphasis coefficient is 0.9375. As previously noted, this fitter pre- 
emphasizes the speech sample at approximately +6 dB per octave. 

Block 702 sets the channel count equal to one, and initializes the output sample total to zero. Block 703 tests to 
see is the channel count Is equal to the total number of channels N. If this decision is negative, the noise suppression 
IS loop begins by filtering the speech sample through the bandpass filter corresponding to the present channel count. As 
noted earlier, the bandpass filters are digitally implemented using DSP techniques such that they function as 4-pole 
Butterworth bandpass filters. - 

The speech sample output from bandpass filter(cc) is then full-wave rectified in block 705, and low-pass filtered 
in block 706, to obtain the energy envelope value E(^) for this particular sample. This channel energy estimate is then 
20 stored by block 707 for later use. As will be apparent to those skilled in the art, energy envek>pe value is actually 
an estimate of the square root of the energy in the channel. 

Block 708 obtains the raw gain value RG for channel cc and performs gain smoothing by means of a first order 
MR filter, implementing the equation: 

G(nT)=G((n-1 )T)+K2(cc)(RG(nT)-G(n-1 )T) 

where G(nT) is the smoothed channel gain at time nT, T Is the sample period, G((n-1)T) is the smoothed channel gain 
at time (n-1)T, RG(nT) is the computed raw channel gain for the last frame period, and K2(cc) is the filter coefficient 

30 for channel cc. This smoothing of the raw gain values on a per-sample basis reduces the discontinuities in gain changes, 
thereby significantly improving noise flutter performance. 

Block 709 multiplies the filtered sample obtained in block 704 by the smoothed gain value for channel cc obtained 
from bkxk 708. This operation modifies the level of the bandpass filtered sample using the current channel gain, 
corresponding to the operation of channel gain modifier 250. Block 710 then adds the modified filter sample for channel 

35 cc to the output sample total, which, when performed N times, combines the N modified bandpass filter outputs to form 
a single processed speech sample output. The operation of block 710 corresponds to channel combined 260. Block 
711 increments the channel count by one and the procedure in blocks 703 through 711 is then repeated. 

If the result of the test in 703 Is true, the output speech sample is de-emphasized at approximately -6 dB peroctave 
in block 71 2 according to the equation: 

40 

Y(nT)=X(nT)+K3[Y((n.1)T)] 

where X(nT) is the processed sample at time nT, T is the sample period, Y(nT) and Y((n-1 )T) are the de-emphasized 
45 speech samples attlmes nTand (n-1 )T respectively, and K3 Is the de-emphasis coefficient which has a value of 0.9375. 
The de-emphasized processed speech sample is then output to the D/A converter block 61 3. Thus, the noise suppres- 
sion loop of FIG. 6a illustrates both the channel filter-bank noise suppression technique and the per-sample channel 
gain smoothing technique. 

The flowchart of FIG. 6b more rigorously describes the- detailed detailed operation of automatic gain selector block 
so 614 of FIG. 5. Following processing of all speech samples In a particular frame, the operation is turned over to block 
615 which serves to update the indivtdaul channel gains. First of all, the channel count (cc) is set to one in block 720. 
Next, decsion block 721 tests if all channels have been processed. If not, operation proceeds with block 722 which 
calculates the signal-lo-noise ratio for the particulal channel. As previously mentioned, the SNR calculation is simply 
a division of the per-channel energy estimates (signal-plus-noise) by the perchannel background noise estimates 
55 (noise). Therefore, block 722 simply divides the current stored channel energy estimate from block 707 by the current 
background noise etimate from block 628 according to the equation: 
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Index (cc)=[current frame energy for channel cc)/background 
noise energy estimale for channel ccj. 

5 

In block 723. the particular gain table to be indexed Is chosen. The quantized value of the current valley level is 
used to perform is selection. However, any method of gain table selection may be used. Furthermore, no gain table 
selection is required for noise suppression systems implementing a single gain table. 

The SNR index calculated in block 722 is used in block 724 to look up the raw channel gain value from the appro- 
10 priate gain table. Hence, the gain value is indexed as a functbn if two or three variables: (1) the channel number; (2) 
the current channel SNR estimate; and possibly (3) the overall average background noise level. 

Block 725 stores the raw gain value chosen by block 724. The channel count is incremented in block 726, and 
then decision block 721 is re-entered. After alt N channels gains have been updated, operation proceeds to block 621 . 
Hence, automatic gain selector block 615 updates the channel gain values on a Irame-by-frame basis to more accu- 
is rately reflect the current SNR of each particular channel. 

FIG. 6c and FIG. 6d expands upon block 621 to more specifically describe the function of a prior art automatic 
background noise estimator. Particularly, FIG. 6c describes the process of simulating the post-processed energy and 
combining these estimates, while FIG. 6d describes the operation of valley detector 570. 

Referring now to FIG. 6c, the operation for simulating post-processed speech begins at block 730 by setting the 
20 channel count (cc) to one. Block 731 tests this channel count to see if all N channels have been processed. If not, the 
equation of block 732 describes the actual simulation process performed by a prior art energy estimate modifier 

Simulated post-processed speech energy is generated by multiplying the raw channel gain values (obtained directly 
from the channel gain tables) by the pre-processed energy estimate (obtained from channel energy estimator 220) for 
each channel via the equatbn: 

25 

SE(cc)=E(cc) RG(cc) 



where SE(cc) is the simulated post-processed energy for channel cc, E(cc) is the current frame energy estimate for 
30 channel cc stored by block 707, and RG(cc) is the raw channel gain value for channel cc obtained from block 725. As 
noted earlier, E(cc) is actually the square root of the energy in the channel since it is a measure of the signal envelope. 
Hence, the RG(cc) term of the above equation is not squared. The multiplication performed in block 732 serves es- 
sentially the same function as channel gain modifier 250 - except that the channel gain modifier utilizes pre-processed 
speech s/gna/ whereas energy estimate modifier 560 utilizes pre-processed speech energy. 
3S The channel counter is then incremented in block 733, and retested in block 731 . When a simulated post-processed 

energy value is obtained for all N Channels, blocks 734 through 738 serve to combine the individual simulated channel 
energy estimates to form the single overall energy estimate according to the equation: 



N 

X CHANNEL (i) POST-PROCESSED ENERGY 
i=l 



45 where N is the number of filters in the filter bank. Block 734 initializes the channel count to one, and block 735 initializes 
the overall post-processed energy value to zero. After initialization, decision block 736 tests whether or not all channel 
energies have been combined. If not, block 737 adds the simulated post-processed energy value for the current channel 
. to the overall post-processed energy value. The current channel number is then incremented in block 738, and the 
channel number is again tested at block 736. When all N channels have been combined to form the overall simulated 

so post-processed energy estimate, operation proceeds to block 740 of FIG. 6d, 

Referring now to FIG. 6d, blocks 740 through 745 illustrate how the post-processed signal energy is used to gen- 
erate and update the previous valley level, corresponding to the operation of a prior art energy valley detector. After 
all the post-processed energies per channel have been combined, block 740 computes the logarithm of this combined 
post-processed channel energy. One reason that the log representatbn of the post processed speech energy is used 

55 in the embodiment is to facilitate implementation of an extremely large dynamic range (>90 dB) signal in an 8-bi1 
microprocessor system. 

Decision block 741 then tests to see if this log energy value exceeds the previous valley level. As previously 
mentioned, the previous valley level is either the stored valley level for the prior frame or an initialized valley level 
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provided by block 601 of FIG. 6. If the log value exceeds the previous valley level, the previous valley level is updated 
in block 743 with the current log [post-processed energy] value by increasing the level with the slow tinne constant of 
approximately one second to fomn a current valley level. This occurs when voice or a higher background noise level 
is present, Conversely, if the output of decisbn block 741 is negative (log[post-processed energy] less than previous 

s valley level), the previous valley level is updated in block 742 with the current log [post-processed energy] value by 
decreasing the level with a fast time constant of approximately 40 milliseconds to form the current valley level. This 
occurs when a bwer background noise level is present. Accordingly, the background noise history is continuously 
updated by slowly increasing or rapidly decreasing the previous valley level, depending upon the background noise 
level of the current simulated post-processed speech energy estimate. 

10 After updating the previous valley level, decision block 744 tests if the current log [post-processed energy] value 

exceeds the current valley level plus a pre-detemained offset. The addition of the current valley level plus this valley 
offset produces a noise threshold level. This offset provides approximately a 6 dB increase to the currant valley level. 
Hence, another reason for utilizing log arithmetic is to simplify the constant 6 dB offset addition process. 

If the log energy exceeds this threshold - which would correspond to a frame of speech rather than background 

IS noise - the current background noise estimate is not updated, and the background noise updating process terminates. 
If, however, the log energy does not exceed the noise threshold level - which would correspond to a detected minima 
in the post-processed signal indicating that only noise is present - the background noise spectral estimate is updated 
in block 745. This corresponds to the closing of channel switch 575 in response to a positive valley detect signal from 
energy valley detector 570. This updating process consists of providing a time-averaged value of the pre-processed 

20 channel energy estimate for the particular channel by smoothing the estimate (in smoothing filter 580), and storing 
these time averaged values as per-channel noise estimates (in energy estimate storage register 585). The operation 
of background noise estimator bkxk 621 ends for the particular frame being processed by proceeding to block 629 
and 630 to obtain a new frame. 

Now, to describe the operation and structure of the present invention in detail (and referring again to FIG. 1). 

25 improved noise suppression 800 incorporates changes to the aforementioned Borth noise suppressran system in three 
basic area: (a) the updating of background noise estimates by voice metric calculator 810; (b) the modification of SNR 
estimates by channel SNR modifier 820; and (c) utilization of SNR threshold block 830 to offset the gain rise of each 
channel. Each of these improvements will be described in terms of the block diagram of Figure 1, and in the terms of 
the flowchart of Figure 4a - 4f. 

30 Voice metric calculator 81 0 replaces the valley detector circuitry of the previous system. A voice metric is essentially 

a measurement of the overall voice-like characteristics of all the channel energies. In the preferred embodiment, voice 
metric calculator 810 is implemented as a kx3k-up table which translates the individual channel SNR estimates at 235 
into voice metric values. The voice metric values are used internally to determine when to update the background noise 
estimate, by closing channel switch 575 for one frame. As used herein, updating the background noise estimate is 

3S defined as partially modifying the old background noise estimate with a new estimate using, for example, a 10%/90% 
. - new-to-old estimate ratio. The voice metric values are also used in the channel SNR modifying process as will subse- 
quently be described. 

From the perspective of nnaking a background noise update decision, a frame having high energy, whk;h is typically 
indicative of a speech frame, could also mean that a narrowband noise transient or a sudden increase in the background 

40 noise level has occurred. Therefore, the present invention characterizes the frame energy as a voice metrk: sum, 
VMSUM. and utilizes this multi-channel energy parameter to perform the updating decision. The process utilizes a 
voice metric table which may be represented as a curve as shown in figure 2. 

Figure 2 is a graph illustrating the characteristk; curve of the voice metrics for a particular channel, the horizontal 
axis represents SNR estimate indices. Each SNR estinriate index value represents three-eighths (3/8) dB signal-to 

4S noise ratio. Hence, an SNR estinnate index of 10 represents 3.75 dB SNR. The vertical axis represents voice metric 
values VM(CC) for each of the N Channels. Note that a voice metric of 2 is produced for an SNR index ot 1 . Also note 
that the curve is not linear, since a channel energy has more voice-like characteristics at higher SNR's. 

First, the raw SNR estimates are used to index into the voice metric table to obtain a voice metric value VM(CC) 
for each channel. Second, the individual channel voice metric values are summed to create the total of all individual 

50 channel voice metric values, called the voice metric sum VMSUM. Third, VMSUM is compared to an UPDATE THRESH- 
OLD representative of a voice metric total that is deemed to be noise. If the multi-channel energy parameter VMSUM 
is less that the UPDATE TH RESHOLD, the particular frame has very few voice-like characteristics, and is most probably 
noise. Therefore, a background noise update is performed by closing channel switch 575 for the particular frame. The 
most recent voice metric sum VMSUM is also made available to channel SNR modifier 820 via line 815 for use in the 

55 modification algorithm. 

In the preferred embodiment, the UPDATE THRESHOLD is set to a total voice metric sum value of 32. Since the 
minimum value in the voice metric table is 2. the minimum sum for 14 channels is 28. The voice metric tables values 
remain at 2 until an SNR index of 12 (or 4.5 dB SNR) is reached. This means that an increased level of broadband 
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noise (individual channels each having SNR values not greater that 4.125 dB) will still generate a sum of 28. Since the 
UPDATE THRESHOLD of 32 would not then be exceeded, the broadband noise voice metric will bo correctly classified 
as noise, and a background noise update will be' performed. Conversely, any single channel having an SNR index 
value greater than 24 (or at least 9.0 dB SNR) would cause the VMSUM to exceed the UPDATE THRESHOLD, and 

5 result in a voice or narrowband noise burst decision. 

Many variations of the voice metric table are possible, as different types of metrics may be compensated for by 
the proper selection of the UPDATE THRESHOLD. Furthermore, the sensitivity of the speech/noise decision may also 
be chosen for a particular application. For example, in the preferred embodiment, the threshold may be adjusted to 
accommodate any single channel having an SNR value as sensitive as 4.5 dB to as Insensitive as 15 dB. The corre- 

10 sponding UPDATE THRESHOLD would then be set within the range of 29 to 41 . 

In addition to performing the speech/noise decision utilizing voice metrics, voice metric calculator 81 0 keeps track 
of the time that has expired since the last background noise update. An update counter is tested on each frame to see 
if more than a given number of names, each representing a predetemnined time, has passed since the previous update. 
In the preferred embodiment utilizing 10 millisecond frames, if the update counter reaches 100 --corresponding to a 

IS timing threshold of 1 second without updates- an update is performed regardless of the voice metric decision. However, 
any timing threshold within the range of 0.5 second to 4 seconds would be practical. As previously mentioned this 
timing parameter test is used to prevent any sudden, large Increases in noise level from being indefinitely interpreted 
as voice. 

The basic function of channel SNR nrxxlifier 820 is to eliminate the detrimental effects of narrowband noise bursts 
20 on the noise suppression system. A narrowband noise burst may be defined as a momentary increase in channel 
energy for only a few channels. In the preferred embodiment, a high energy level above a 6 dB SNR threshold in fewer 
than 5 of the upper 10 channels is classified as a narrowband noise burst. Such a noise burst wouki normally create 
high gain values for only a few number of channels, which results in the "running water" type of background noise 
flutter described above. 

25 Raw SNR estimates at 235 are applied to the input off channel SNR modifier 820, and modified SNR estimates 

are output at 825. Basically, SNR modifier 820 counts the number of channels which have channel SNR index values 
which exceed an index threshold. In the preferred embodiment, the index threshold is set to correspondto an SNR 
value within the range of 4 dB to 10 dB, preferably 6 dB SNR. If the number of channels is below a predetermined 
count threshold, then the decision to modify the SNR's is made. The count threshold repr.esents a relatively few number 

30 of channels, i.e., not greater than 40% of the total number of channels N. In the preferred embodiment, the count 
threshold is set to 5 of the 10 measured channels. During the modification process itself, channel SNR modifier 820 
either reduces the SNR of only those particular channels having an SNR index less than a SETBACK THRESHOLD 
(indicative of a narrowband noise channel), or reduces the SNR of all the channels if the voice metric sum is less than 
a metric threshold (indicative of a very weak energy frame). Hence, the channels containing the narrowband noise 

3S burst are attenuated so as to prevent them from detrimentally affecting the gain table look-up function. 

. . , . ..SNR threshold block 830 proves a predetermined SNR threshold for each channel which must be exceededby 
the modified channel SNR estimates before a high gain value can be produced. Only SNR estimates which have a 
value above the SNR threshold are directly applied to the gain table sets. Therefore, snnall background noise fluctuation 
are not albwed to produce gain values which represent voice. This implementation of an SNR threshold essentially 

40 presents an offset in the gain rise for channels having low signal-to-noise ratio. Preferably, the SNR threshold would 
be set within the range of 1 .5 dB to 5 dB SNR to eliminate minor noise fluctuations. The SNR threshold may be imple- 
mented as a separate element as shown in Figure 1 , or it may be implemented as a "dead zone" in the characteristic 
gain curve for each gain table set 590. 

Figure 3 graphrcally illustrates the function of SNR threshold block 830, as well as the attenuation function of the 

45 channel gain values in each gain table set. On the horizontal axis, modified SNR estimates are shown in dB as would 
be output from channel SNR modifier 820 at 825. The vertk;al axis represents the channel gain (attenuation) as would 
be obsen^ed at the output of channel gain modifier 250 at 255. A maximum amount of background noise attenuation 
is achieved for channels having a minimum gain value. Note that SNR threshold block 830 is shown as "dead zone" 
or offset in the gain rise cun/e of approximately 2.25 dB. Hence, an SNR estimate must exceed this threshold before 

50 the channel gain can rise above the minimum gain level shown. Also note that two curves are illustrated, each having 
a different minimum gain level. Upper curve labeled A group represents a low channel group, e.g., consisting off channels 
1-4 in the preferred embodiment, while group B represents the higher frequency channels 5-14. 

As evident from the graph, the k>w frequency channels have a minimum gain value of-13.1 dB, while the upper 
frequency channels have a minimum gain value of -20.7 dB. It has been found that less voice quality degradation 

ss occurs wrtien the channels are divided into such groups. Although only two different gain cun/esare used in the preferred 
embodiment for gain table set number 1 , it may prove advantageous to provide each channel with a different charac- 
teristic gain curve. Furthermore, as explained in the referenced Borth patent, multiple gain table sets are used to allow 
a wider choice of channel gain values depending on the particular background noise environment. Noise level quantizer 



9 

03/28/2003, EAST Version: 1.03.0002 



EP 0 380 563 B1 



555 utilizes hysteresis to select a particular gain table set based upon thie overall background noise estimates. The 
gain table selection signal, output from noise level quantizer 555, is applied to gain table switch 595 to implenrient the 
gain table selection process. Accordingly, one of a plurality of gain table sets 590 may be chosen as a function of 
overall average background noise level. 

s These noise suppression improvements eliminate the variability of the background noise suppression without re- 

quiring a large amount of gain smoothing. Background noise attenuation within the range of 1 0 dB to 25 dB is readily 
achieved with the present invention. With the improvements, the system requires gain smoothing having a time constant 
of only 10 to 20 milliseconds to obtain a flat or "white" residual noise background. Previous techniques required 40 to 
60 millisecond time constant gain smoothing, which not only resulted in imperfect flutter reduction, but also substantially 

10 degraded the voice quality. 

Since the overall operation of the improved noise suppression system is similar to that described in the previous 
descriptluon of the prior art US-A-4,628,529, the generalized flow diagram illustrated in figures 5a/b of this applicatbn 
are again used to describe the present invention. The general organizatbn of the operation of the present invention 
may still be organized in three functional groups: noise suppression loop - sequence bk)ck604 of previously described 

75 prior art figure 5a, whch is described In greater detail in prior art Figure 6a of the present application; automatic gain 
selector - sequence 61 5 of previously described prior art figure 5b, which has been modified for the present inventbn; 
- - and automatic background noise estimator - sequence 621 of prior art Figure 5b, which has also been modified in the - 
present invention. The detailed flowcharts of Figure 4a through 4f of the present application may be substituted for 
sequence blocks 61 5 and 621 of prior art Figure 5b to describe the operation of improved noise suppression system 

20 800. Hence, prior art Figure 5a and 6a of the previously described Borth patent (4,628,529) describe the noise sup- 
pression loop performed on a sample-by-sample basis, while Figures 4a through 4f of the present invention describe 
the channel gain selectk>n process and the background noise estimate update process performed on a trame-by-f rame 
basis. 

Referring now to Figure 4a, the operation of improved noise suppressbn system 800 begins from the "YES" output 
25 of decision step 614 of the aforementioned Figure 6a. Hence the actual spectral gain modification function for the 
particular frame has already been performed on a sample-by-sample basis utilizing gain values from the previous 
frame. Sequence 850 serves to generate the SNR estimates available at 235, First of all, the channel count CC is set 
equal to 1 in step 851 . Next, the voice metric sum variable VMSUM is initialized to zero in step 852. Step 853 calculates 
the raw signal-to-noise ratra SNR for the partrcular channel as an SNR estimate index value INDEX(CC). The SNR 
30 calculation is simply a division of the per-channet energy estinr^tes (signal-plus-noise) available at 225, by the per- 
channel background noise estimates (noise) at 325. However, other estimates of the signal-to-noise threshokj may 
alternatively be used. Therefore, step 853 simply divides the current stored channel energy estimate (obtained from 
flowchart step 707 of the aforementioned Figure 7a) by the current background noise estimate BNE(CC) from the 
previous frame. 

35 In sequence 860, the voice metrics are calculated. First, the voice metric table for the particular channel is indexed 
. in step 861 using the raw SNR estimate index INDEX(CC). The vok;e metric table is read in step 862 to obtain a voice 
metric value VM(CC) for that particular channel. This individual channel voice metric value is added to the voice metric 
sum VMSUM in step 863. The channel count CC is incremented in step 864, and tested in step 865. If the voice metrics 
for all N channels have not been calculated, control retums to step 853. 

'io Sequence 870 Illustrates the background noise estimate update decision process performed by voice metric cal- 

culator 81 0. The voice metric sum VMSUM is compared to UPDATE THRESHOLD in step 871 . If VMSUM is less than 
or equal to UPDATE THRESHOLD, then the frame is probably a noise frame, TIMER FLAG is reset in step 872, and 
the update counter UC is reset in step 873. Control proceeds to 878 where the UPDATE FLAG is set true, which means 
that a background noise estimate update will be performed for the current frame. 

45 If VMSUM is greater that the UPDATE THRESHOLD, the frame is probably a voice frame. Nevertheless, step 874 

tests the TIMBER FLAG to see if a sudden, toud increase in background noise has been interpreted as speech. If the 
TIMER FLAG is true, the one second time interval was exceeded a number of frames ago, and background noise 
estimate updating is still required. This Is due to the fact that only a partial background noise update is performed for 
each frame. If the TIMER FLAG is not true, the update counter UC is incremented in step 875, and tested in step 876. 

50 If 100 frames have occurred since the last background noise estinaate update, the TIMER FLAG is set true in step 877, 
and the BNE UPDATE FLAG is set true in step 878. A series of partial background noise estimate updates are then 
performed until the voice metric sum VMSUM again falls below the UPDATE THRESHOLD. Note that the only place 
in the flowchart that the TIMBER FLAG is reset is in step 872, when the voice metric sum VMSUM again resembles 
noise. If the update counter UC has not reached 100 frames, the instant frame Is deemed to be a voice frame, and no 

55 background noise update is performed. 

Referring now to sequence 880 of Figures 4b and 4c, the decision to nnodify the channel signal-to-noise ratios is 
performed next. An index counter variable IC is initialized in step 881 . The channel counter CC is set equal to 5 in step 
882, so as to count only the upper 10 of the 14 channels having a high energy The raw SNR estimate index INDEX 
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(CC) is tested in step 883 to see if it has reached an INDEX THRESHOLD which would correspond to approximately 
6 dB SNR. Here, the assumption is made that at least 5 of the upper 10 channels ot a voice frame should contain 
energy having an SNR of at least 6 dB. If the particular channel SNR INDEX(CC) is above the INDEX THRESHOLD, 
the index count IC is incremented In step 884. If not, the channel count CC is incremented in step 885 and tested in 

5 step 886 to look at the next channel. ^ 

When all 10 upper channels have been measured, index count IC represents the number of channels having an 
SNR estimate index higher than the INDEX THRESHOLD. The index count IC is then tested against a COUNT 
THRESHOLD in step 887. If IC indicates that more channels than the COUNT THRESHOLD, e.g., 5 of the upper 10 
channels, contain sufficient energy, then the frame is probably a voice frame, and the MODIFY FLAG is set false in 

10 step 889 to prevent channel SNR modification. If only a few channels contain high energy, which would be represent- 
ative of a frame of narrowband noise, then the MODIFY FLAG Is set true in step 888. 

Sequence 890 describes the SNR modification process performed by channel SNR modifier block 820. Initially, 
the MODIFY FLAG is tested in step 891 . If it is false, the channel SNR modification process Is by^Dassed. If the MODIFY 
FLAG is true, the channel counter CC Is initialized in step 892. Next, each channel SNR estimate index is tested in 

IS step 893 to see if it is less than or equal to a SETBACK THRESHOLD. The SETBACK THRESHOLD, which may have 
a value corresponding to 6 dB SNR, represents the maximum SNR estimate which is representative of background 

noise flutter. Only channels having low SNR estimate index pass this test. However, even if the channel index is greater 

than the SETBACK THRESHOLD, the voice metric sum VMSUM is again tested in step 894. If VMSUM is less than 
or equal to a METRIC THRESHOLD, which corresponds to a representative total voice metric of a narrowband noise 

20 frame, the INDEX (CC) is modified in step 895 by setting it equal to the minimum index value of 1 . The channel counter 
CC is incremented in step 896 and tested in step 897 to see if all the channels have been tested. If not, control returns 
to step 893 to test the next channel index. Hence, a frame containing either channel energy fluctuations or narrowband 
noise is modified such that the frame does not produce undesirable gain variations. 

Sequence 900 performs the functran of SNR threshold block 830. The channel counter CC is initialized in step 

25 901. The SNR index for the particular channel is tested against an SNR THRESHOLD in step 902. In the preferred 
embodiment, the SNR THRESHOLD represents an index value corresponding to 2.25 dB SNR. If INDEX (CC) is above 
the SNR THRESHOLD, it may be used to index the gain table. If not, the index value is again set equal to 1 in step 
903, which represents the minimum index value. The channel counter CC is incremented in step 904 and tested in 
step 905. This SNR threshold testing process serves to reduce minor background noise variations in all the channels. 

30 Referring now to sequence 91 0 of Figure 4d, the gain table sets are chosen by noise level quantizer 555 and gain 

table switch 595. In step 91 1 , the channel counter CC is Initialized, and in step 91 2, a variable called background noise 
estimate sum, BNESUM, is intialized. In step 913, the current background noise estimate BNE(CC) is obtained for 
each channel, and added to BNESUM in step 914. Step 915 increments the channel counter CC, and step 916 tests 
the channel counter to see if the background noise estimates for all N channels have been totaled. 

35 In step 917, BNESUM is compared to a first background noise estimate threshold. If it is greater than BNETHRESH- 

..OLD 1, then.gain table set number 1 is selected In step 918. Similarly, step 919 again . tests BNESUM to see if it Is . 
greater than the lower value of BNE THRESHOLD 2. If BNESUM is greater than BNE THRESHOLD 2 but less than' 
BNE THRESHOLD 1, then gain table set number 2 is selected in step 920. Otherwise, gain table set number 3 is 
selected in step 921 . Hence, gain table sets 590 are selected as a function of overall average background noise level. 

"^0 Sequence 930 describes the steps for obtaining raw gain values RG (CC) from the gain table sets 590. Step 931 

sets the channel counter CC equal to 1 . The selected gain table is indexed in steo 932 using the channel SNR estimate 
index INDEX(CC) which has passed the SNR modification and threshold tests. The raw gain value RG(CC) is obtained 
from the selected gain table in step 933. and is then stored in step 934 for use as the gain values for the next frame 
of noise suppression. The channel counter CC is incremented in step 935, and tested in step 936 as before. As de- 

45 scribed in U.S. Patent No. 4,630,305, the raw gain values for each channel at 535 are then applied to gain smoothing 
filter 530 for smoothing on a per-sample basis. 

Finally, sequence 940 describes the actual background noise estimate updating process performed in block 420 
of Figure 1 . Step 941 initially tests the UPDATE FLAG to see if a background noise estimate should be performed. If 
the UPDATE FLAG is false, then the frame is a voice frame and no background noise update can occur Otherwise, 

50 the background noise update is performed - which is simulated by closing channel switch 575 - during a noise frame. 
In step 942, the UPDATE FLAG is reset to false. Steps 942 through 945 sen^e to update the current background noise 
estimate in each of the N channels via the equation: 

55 E(i,k) = E(u,k-1) +SF[(PE(i)-E(i,k.1)], 

1=1.2 , N 

where E(i,k) is the current energy noise esfimate for channel (i) at time (k). E(i. k-l) is the oW energy noise estimate ] 
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for channel (i) at time (k-l), PE(i) is the current pre-processed energy estinnate for channel (I), and SF is the smoothing 
factor time constant used in smoothing the background noise estimates. Therefore. E(i, k-l) is stored in energy estimate 
storage register 585. and the SF temi performs the function of smoothing filter 580. In the present embodiment, SF is 
selected to be 0.1 for a 1 0 millisecond frame duration. 

B step 943 Initializes the channel count CC to 1. Step 944 performs the above equatbn in terms of the current 

background noise estimate available at 325, the old background noise estimate available at 325, the old background 
noise estimate OLD BNE (CC) stored in energy estimate storage register 585, and the new background noise estimate 
NEW BNE(CC) available from switch 575. Step 945 increments the channel counter CC, and step 946 tests to see of 
all N channels have been processed. If true, the background noise estimate update Is completed, and operation Is 

10 relumed to step 629 of Figure 6b of the aforementioned Borth patent to reset the sample counter and increment the 
frame counter. Control then returns to perfonm noise suppression on a sample-by -sample basis for the next frame. 

In review, it can now be seen that the present invention provides the following improvements: (a) a reduction in 
background noise flutter by offsetting the gain rise of the gain tables until a certain SNR value is obtained; (b) immunity 
to narrowband noise bursts through modification of the SNR estinnates based on the voice metric calculation and the 

IS channel energies; and (c) more accurate background noise estimates via performing the update decision based on 
the overall voice metric and the time interval since the last update. 

While specific embodiments of the present inventbn have been shown and described herein, further modification - 
and Improvements may be made by those skilled in the art. For example, the operational flow is described herein as 
performed in real time. However, due to inherent hardware limitations, previous background noise estimates for channel 

20 gain values may be stored for use in the next frame. All such modification which retain the basic underlying principles 
disclosed and claims herein are within the scope of this invention. 



Claims 

25 

1. A noise suppression system (800) for attenuating the background noise from a noisy input signal (205) to produce 
a noise-suppressed output signal (265), said noise suppression system comprising: 



means (210) for separating the input signal (205) into a plurality of pre-processed signals representative of 
30 selected frequency channels; 

means (220) for generating estimates of the signal-plus-noise energy and the noise energy in each individual 
channel; and 

means (310, 810) for producing a gain value for each individual channel in response to said channel energy 
estimates; 

35 

the system (800) characterized in that:. 



said gain values have a minimum gain value for each channel; 

said means (310, 810) for producing a gain value includes threshold means for allowing gain values above 
40 said minimum gain value to be produced only when said signal-plus-noise energy estimates exceed said noise 

energy estinnates by a predetermined amount; and 

means for modifying the gain of each of said plurality of pre-processed signals in response to saki gain values 
to provide a plurality of post-processed signals. 



45 2. The noise suppression system according to claim 1 , wherein said means (310, 810) for producing a gain value 
produces gain values based upon the signal-to-noise ratio (SNR) of said channel energy estimates, and wherein 
said SNR estimates are compared with a predefined SNR threshold such that channels having SNR estimates 
below said SNR threshold produce minimum gain values. 



50 3. The noise suppression system according to claim 2, wherein sakl predefined SNR threshold corresponds to an 
SNR value within the range of 1.5 dB to 5 dB SNR. 



4. The noise suppression system according to claim 3, wherein said predefined SNR threshold corresponds to an 
SNR value of approximately 2.25 dB SNR. 

55 

5. The noise suppression system according to any preceding claim, wherein said gain modifying means provides a 
maximum amount of attenuation of the pre-proposed signal in a particular channel having a minimum gain value. 
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6. The noise suppression system according to any preceding claim, wherein gain values produce a higher amount 
of attenuation for high frequency channels than low frequency channels. 

7. The noise suppression system according to any preceding claim 1 , wherein said means (310, 810) for producing 
s a gain value further includes a plurality of gain tables, each gain table having predetenmined Individual channel 

gain values corresponding to said Individual channel energy estimates, and gain table selection means for auto- 
matically selecting one of said plurality of gain tables as a function of the overall average background noise level 
of said Input signal. 

10 8. The noise suppression system according to any preceding claim, further including means for combining said plu- 
rality of post-processed signals to produce said noise-suppressed output signal. 

9. The noise suppression system of any preceding claim, wherein said means (220) for gemating estimates includes: 

IS means for generating and storing an estimate of the background noise power spectral density of said pre- 

processed signals, said background noise estimate generating means Including means for modifying said 
background noise estimate in response to a timing parameter indicative of the time interval since the previous 
background noise estimate modification; and 

means for generating an estimate of the signal-to-nolse ration (SNR) In each individual channel based upon 
20 said modified background noise estimates; 

wherein said means (310, 810) for producing a gain value for each individual channel produces each gain 
value in response to said channel SNR estimates. 

10. The noise suppression system according to claim 9, wherein said background noise estimate modifying means 
25 includes means for producing said timing parameter, and means for comparing said timing parameter to a prede- 
termined timing threshold such that a background noise estimates modification is performed when said timing 
parameter exceeds said timing threshold. 

11. The noise suppressron system according to claim 10, wherein said background noise estimate modifying means 
30 further includes means for generating an estimate of the energy In each Individual channel, and means for pro- 
ducing a multi-channel energy parameter in response to the total value of all individual channel energy estimates. 

12. The noise suppression system according to claim 11, wherein said multi-channel energy parameter producing 
means accommodates for minor variations in indivkJual channel energy estimates such that said minor variations 

35 do not significantly affect saki multi-channel energy parameter. 

13. The noise suppression system according to claim 11 or 12, wherein said background noise estimates modifying 
means further includes means for comparing said multi-channel energy parameter to a predetermined energy 
threshold such that a background noise estimates modification is performed when said multi-channel energy pa- 

40 rameter is less than said energy threshold. 

14. The noise suppression system according to claim 13, wherein said background noise estimates modifying means 
modifies said background noise estimates in response to sakJ timing parameter regardless of said multi-channel 
energy parameter 

45 

15. The noise suppression system according to any one of claims 11 to 1 4, wherein sa\6 multi -channel energy param- 
eter is generated by translating said individual channel SNR estlnnates the individual channel voice metrics, the 
voice metric sum being a measurement of the overall voice-like characteristics of the energy in all channels. 

so 16. The noise suppression system according to any preceding claim, further comprising: 

means (81 5, 820, 830) for monitoring said channels energy estimates and for distinguishing narrowband noise 
bursts from speech energy and background noise energy, thereby producing a modification signal (835); 
means (590) for selectively modifying said channels energy estimates in response to said modification signal 
55 (835) such that channel energy estimates representative of narrowband noise bursts are modified; 

means for producing a gain value for each individual channel in response to each modified channel energy 
estimate; and 

means for nrxxJifying the gain of each of said plurality of pre-processed signals in response to said gain values 
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to provide a plurality of post-processed signals. 

17. The noise suppression system according to claim 16, wherein said modification signal is indicative of the total 
number of individual channels having energy estimates exceeding a predetermined energy threshold. 

5 

18. The noise suppression system according to claim 16 or 17, wherein said channel energy estimates modifying 
means includes means (830) for comparing said modification signal to a predetemnined count threshold such that 
a channel energy estimates nnodification is performed when said total number of individual channels is less than 
said count threshold. 

10 

19. The noise suppression system according to claim 16, 17 or IB, wherein said gain modifying means provides a 
maximum amount of attenuation of the pre-processed signal in a particular channel having a modified channel 
energy estimate. 

15 20. A rtiethod of attenuating the background noise from a noisy input signal (205) to produce a noise-suppressed 
output signal (265) in a noise suppression system (800) comprising the steps of: 

separating (850) the input signal into a plurality of pre-processed signals representative of a number N of 
selected frequency channels; 
20 generating an estimate of the energy in each individual channel (853); 

generating and storing an estimate of the background noise power spectral density of said pre-processed 
signals; and 

generating an estimate of the signal-to-noise ratio (SNR) in each individual channel based upon said back- 
ground noise estimates and said channel energy estimates; 

25 

the method characterized by the steps of: 

producing (861 , 862) a gain value for each individual channel in response to said channel SNR estimates, 
wherein said gain values have a range of minimal values; 
30 and wherein said gain values producing step Includes the steps of: 

providing a predefined SNR threshold and comparing (902) said channel SNR estinnates to said predefined 
SNR threshold such that channels having SNR estimates below said SNR threshold produce gain values 
within said minimal range; and 

modifying (910) the gain of each of said plurality of pre-processed signals in response to said gain values to 
35 provide a plurality of post-processed signals. 



Patentanspruche 

40 1. Rauschunterdruckungssystem (800) zur Dampfung des Hintergrundrauschens von einem verrauschten Eingangs- 
signal (205), um ein rauschunterdrOcktes Ausgangssignal (265) zu erzeugen, wobei dieses Rauschunterdruk- 
kungssystem umfafM: 

• Mittel (210) zur Aufteilung des Eingangssignals (205) in eine Vielzahl von von^erarbeiteten Signalen, die aus- 
45 gewahlte Frequenzkanale darstellen; 

• Mittel (220) zur Erzeugung von Abschatzungen der Signalplus-Rausch-Energie und der Rauschenergie in 
jedem einzelnen Kanal; und 

• Mittel (310, 810) zur Erzeugung eines Verstarkungswerts fOr jeden einzelnen Kanal in Reaklion auf diese 
Kanalenergieabschatzungen; 

so 

wobei das System (800) dadurch gekennzeichnet ist, daB: 

• diese Verstarkungswerte f Or jeden Kanal einen minimalen Verstarkungswert haben; 

• diese Mittel (310. 810) zur Erzeugung eines Verstarkungswerts Schwellenwertmittel enthalten, die gestatten, 
55 da8 Verstarkungswerte oberhaib dieses minimalen Verstarkungswerts nur erzeugt werden, wenn diese Signal- 
pi us-Rausch-Energieabschatzungen diese Rauschenergieabschatzungen um einen vorbestinomten Betrag 
Oberschreiten; und 

• Mittel zur Modifizierung der Verstarkung jedes aus der Vielzahl vorverarbeiteter Signale in Reaktion auf diese 
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Verstarkungswerle, um eine Vielzahl nachverarbeiteter Signals bereitzustellen. 

2. Rauschunterdruckungssystem nach Anspruch 1 , wobei diese Mitlel (31 0, 81 0) zur Erzeugung eines Verstarkungs- 
werts aut der Grundlage des Signal-Rausch-Abstands (SNR) dieser Kanalenergieabschatzungen Verstarkungs- 
werle erzeugt und wobei diese SNR-Abschatzungen mil einem vordefinierten SNR-Schwellenwert verglichen wer- 
den. so daD Kanale, die SNR-Abschatzungen unter diesem SNR-Schwellenwert haben, minimale Verstarkungs- 
werte erzeugen. 

3. RauschunterdrQckungssystem nach Anspruch 2, wobei dieser vordetinierte SNR-Schwellenwert einem SNR- Wert 
innerhalb des Bereichs von 1,5 dB bis 5 dB SNR entspricht. 

4. Rauschunterdruckungssystem nach Anspruch 3, wobei dieser vordetinierte SNR-Schwellenwert einem SNR- Wert 
von ungetahr 2,25 dB SNR entspricht. 

5. Rauschunterdruckungssystem nach einem der vorhergehenden Anspruche, wobei diese verstarkungsmodifizie- 
renden Mittel einen maximalen Betrag der Dampfung des vorverarbeiteten Signals in einem bestimmten Kanal, 
der einen minimalen Verstarkungswert hat, gewahrleisten. 

6. Rauschunterdruckungssystem nach einem der vorhergehenden Anspruche, wobei die Verstarkungswerle einen 
groBeren Betrag der Dampfung tOr Kanale mit hohen Frequenzen als fur Kanale mit niedrigen Frequenzen erzeu- 
gen. 

7. Rauschunterdruckungssystem nach einem der vorhergehenden Anspruche, wobei diese Mittel (310, 810) zur Er- 
zeugung eines Ver&tarkungswerts welter eine Vielzahl von Verstarkungstabellen enthalten, wobei jede Verstar- 
kungstabelle vorbestimmte einzelne Kanalverstarkungswerte entsprechend dieser einzetnen Kanalenergieab- 
schatzungen hat, und wobei Verstarkungstabellenauswahlmittel zur automat isch en Auswahl einer Verstarkungs- 
tabelleaus dieser Vielzahl von Verstarkungstabellen als eine Funktion des gesamtdurchschnittlichen Hintergrund- 
rauschpegels dieses Eingangssignals vorhanden sind. 

8. RauschunterdrQckungssystem nach einem der vorhergehenden Anspruche, das waiter Mittel zur Kombinierung 
dieser Vielzahl nachverarbeiteter Signale enthatt, um dieses rauschunterdrOckte Ausgangssignal zu erzeugen. 

9. Rauschunterdruckungssystem nach einem der vorhergehenden Anspruche, wobei diese Mittel (220) zur Erzeu- 
gung der Abschatzungen enthalten: 

• Mittel zur E rzeugung und Speicherung einer Abschatzung der Hintergrundrauschleistungsspektraldichte die- 
ser von/erarbeiteten Signale, wobei diese hintergrundrauschabschatzungserzeugenden Mittel Mittel zur Mo- 
difizierung dieser Hinlergrundrauschabschatzung in Reaktion auf einen Zeitparameter enthalten, der eine An- 
zeigedes Zeitinten/alls seit der vorherigen HintergrundrauschabschatzungsrTK)difikation ist; und 

• Mittel zur Erzeugung einer Abschatzung des Signal-Rausch-Abstands (SNR) in jedem einzelnen Kanal auf 
der Grundlage dieser modifizierten Hintergrundrauschabschatzungen; 

wobei diese Mittel (310, 810) zur Erzeugung eines Verstarkungswerts fur jeden einzelnen Kanal jeden Verstar- 
kungswert in(Reaktion auf diese Kanal-SNR-Abschatzungen erzeugen. 

10. Rauschunterdruckungssystem nach Anspruch 9, wobei diese hintergrundrauschabschatzungsmodifizierenden 
Mittel Mittel zur Erzeugung dieses Zeitparameters und Mittel zum Vergleich dieses Zeitparameters mit einem vor- 
bestimmten Zeitgeberschwellenwert enthalten, so da8 eine IHIntergrundrauschabschatzungsmodifikation durch- 
gef uhrt wird, wenn dieser Zeitparameter diesen Zeitgeberschwellenwert uberschreitet. 

11. Rauschunterdruckungssystem nach Anspruch 10, wobei diese hintergrundrauschabschatzungsmodifizlerenden 
Mittel weiter Mittel zur Erzeugung einer Abschatzung der Energie in jedem einzelnen Kanal und Mittel zur Erzeu- 
gung eines Mehrkanalenergieparameters in Reaktbn auf den Gesamtwert aller einzelnen Kanalenergieabschat- 
zungen enthalten. 

12. Rauschunterdruckungssystem nach Anspruch 11, wobei diese mehrkanalenergieparametererzeugenden Mittel 
sich geringen Veranderungen der einzelnen Kanalenergieabschatzungen anpassen, so daB diese geringen Ver- 
anderungen diesen Mehrkanalenergieparameter nicht wesentlich beeinflussen. 
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13. RauschunterdrOckungssystem nach Anspruch 11 Oder 12, wobei diese hintergrundrauschabschatzungs- 
modifizierenden Mittel welter Mittel zum Vergleich dieses Mehrkanalenergieparameters mit einem vorbestimmien 
Energieschwellenwert enthatten, so daQ eino Hintergrundrauschabschatzungsmodifikation durchgefuhrt wird, 
wenn dieser Mehrkanalenergieparameter kleiner als dieser Energieschwellenwert ist. 

14. RauschunterdrOckungssystem nach Anspruch 13, wobei diese hintergrundrauschabschatzungsmodtfizierenden 
Mittel diese Hintergrundrauschabschatzungen in Reaktbn auf diesen Zeitparameter modifizieren. ungeachtet die- 
ses Mehrkanalenergieparameters. 

15. RauschunterdrOckungssystem nach einem der Anspruche 11 bis 14, wobei dieser Mehrkanalenergieparameter 
dadurch erzeugt wird, da6 diese einzelnen Kanal-SNR-Abschatzungen die einzelnen Kanalsprachma3e Oberset- 
zen, wobei die SprachmaRsumme eine Messung der gesamten sprachahn lichen Charakteristiken der Energie in 
alien Kanalen ist. 

16. RauschunterdrOckungssystem nach einem der vorhergehenden Anspruche, weiter umfassend: 

• Mittel (8 15, '820, 830) zur Oberwachung dieser Kanalenergieabschatzungen und zur Unterscheidung der 
Schmalbandrauschbundel von Sprachenergie und Hintergrundrauschenergie, wodurch ein Modlfikationssi- 
gnal (835) erzeugt wird; 

• Mittel (590) zur selektiven Modifizierung dieser Kanalenergieabschatzungen in Reaktbn auf dieses Modifika- 
tionssignal (835), so da8 Kanalenergieabschatzungen, die Schmalbandrauschbundel darstellen. modtfiziert 
werden; 

• Mittel zur Erzeugung eines Verstarkungswerts fur jeden einzelnen Kanal In Reaktion auf jede modifizierte 
Kanalenergieabschatzung; und 

• Mittel zur Modifizierung der Verstarkung jedes aus dieser Vielzahl vorverarbeiteter Signale In Reaktion auf 
diese Verstarkungswerte, um eine Vielzahl nachverarbeiteter Signale berettzustellen. 

17. RauschunterdrOckungssystem nach Anspruch 16, wobei dieses Modiflkationssignal eine Anzeige der Gesamtan- 
zahl der einzelnen Kanale Ist, die Energieabschatzungen haben, die einen vorbestimmten Energieschwellenwert 
uberschreiten. 

18. RauschunterdrOckungssystem nach Anspruch 16 oder 17, wobei diese kanalenergieabschatzungsmodi- 
flzlerenden Mittel Mittel (830) zum Vergleich dieses Modifikationssignals mit einem vorbestimmten Zahlerschwel- 
lenwert enthalten, so dafl eine Kanalenergieabschatzungsmodifikatlon durchgef Ohrt wird, wenn diese Gesamtan- 
zahl der einzelnen Kanale kleiner als dieser Zahlerschwellenwert Ist. 

19. RauschunterdrOckungssystem nach Anspruch 16, 17 oder 18, wobei diese verstarkungsmodifizierenden Mittel 
eInen maximalen Betrag der Dampfung des vorverarbeiteten Signals in einem bestimmten Kanal, der eine modi- 
fizierte Kanalenergieabschatzung hat, gewahrleisten. 

20. Verfahren zur Dampfung des Hintergrundrauschens eines verrauschten Eingangssignals (205), um in einem 
RauschunterdrOckungssystem (800) ein rauschunterdrucktes Ausgangssignal (265) zu erzeugen, wobei das Ver- 
fahren die folgenden Schritte umfaBt: 

• Aufteilung (850) des Eingangssignals in eine Vielzahl vorverarbeiteter Signale, die durch eine Anzahl von N 
ausgewahlten Frequenzkanalen dargestellt werden; 

• Erzeugung einer Abschatzung der Energie in jedem einzelnen Kanal (853); 

• Erzeugung und Speicherung eIner Abschatzung der Hintergrundrauschleistungsspektraldichte dieser vorver- 
arbeiteten Signale; und 

• Erzeugung einer Abschatzung des Signal-Rausch-Abstands (SNR) in jedem einzelnen Kanal auf der Grund- 
lage dieser Hintergrundrauschabschatzungen und dieser Kanalenergieabschatzungen; 

wobei das Verfahren durch die folgenden Schritte gekennzeichnet ist: 

• Erzeugung (861, 862) eines Verstarkungswerts fur jeden einzelnen Kanal in Reaktion auf diese Kanal-SNR- 
Abschatzungen, wobei diese Verstarkungswerte einen Bereich minimaler Werte haben; 

und wobei dieser verstarkungswerterzeugende Schritt die folgenden Schritte enthalt: 
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• Bereitstellung eines vordefinierten SNR-Schwellenwerts und Vergleich (902) dieser Kanal-SNR-Abschatzun- 
gen mit diesem vordefinierten SNR-Schwellenwert, so daB Kanale, die SNR-Abschatzungen unter diesem 
SNR-Schwellenwert haben, Verstarkungswerte innerhalb dieses minlmaten Bereichs erzeugen; und 

• Modrfizienjng (910) der Verstarkung jedes aus dieser Vielzah! vcn/erarbeiteter Signale in Reaktion auf diese 
Verstarkungswerte, urn eine Vielzaht nachverarbeiteter Signale bereitzustellen. 



Revendications 

1 . Syst6me d'6limination de bruit (800) permettant d'att6nuer le bruit de fond provenant d'un signal d'entrde bruyant 
(205) pour produire un signal de sortie dont le bruit a 6X6 supprime (265), tedit syst^me d'^limination de bruit 
comprenant : 

un moyen (210) permettant da s6parer le signal d'entr^e (205) en une plural ite de signaux prdtraitds repre- 
sentant les voies de frequence choisies ; 

un nrtoyen (220) permettant de g^n^rer des estimatbns de I'^nergie du signal plus bruit et de I'^nergie du bruit 
dans chaque voie individuelle ; et- 

un moyen (310, 810) permettant de produire une valeur de gain pour chaque voie individuelle en rdponse 
auxdites estimations d'energie de vole ; 

le syst^me (800) 6tant caract6ris6 en ce que : lesdites valeurs de gain pr^sentent une valeur de gain minimum 
pour chaque voie ; 

ledit moyen (310,810) permettant de produire une valeur de gain comporte un moyen formant seu il permettant 
^ des valeurs de gain sup^rieures k ladrte valeur de gain minimum d'dtre produites uniquement lorsque lesdites 
estimations d'energie de signal-plus-bruit ddpassent lesdites estimations d'dnergie de bruit d'une quantity 
pr^ddterminde ; et 

un moyen permettant de modifier le gain de chaque signal pr6trait6 de ladite plurality de signaux pr6trait6s 
en r6ponse auxdites valeurs de gain afin de foumir une plurality de signaux post-trait6s. 

2. Systems d'^limination de bruit selon la revendication 1 , dans lequel ledit moyen (31 0, 81 0) penmettant de produire 
une valeur de gain produit des valeurs de gain qui sont fonctlon du rapport signal/bruit (SNR) desdites estimations 
d'energie de voie, et dans lequel lesdites estimations de SNR sont compar^es k un seuil de SNR pr^dSfini de 
sorte que les voies prdsentant des estimations de SNR infdrieures audit seull de SNR produisent des valeurs de 
gain minimum. 

3. Syst6me d'6llmination de bruit selon la revendication 2. dans lequel ledit seuil de SNR pr6d6flni correspond ^ une 
valeur de SNR comprise entre 1 ,5 dB et 5 dB. 

4. Syst^me d'^limination de bruit selon la revendication 3. dans lequel ledit seuil de SNR pred^fini correspond k une 
valeur de SNR d'environ 2,25 dB. 

5. Syst^me d'^limination de bruit selon I'une quelconque des revendications precSdentes, dans lequel ledit moyen 
de modification de gain foumit une quantity maximum d'atl6nuation du signal pr6-propos6 dans une voie particu- 
li^re ayant une valeur de gain minimum. 

6. Systems d'6liminatlon de bruit selon I'une quelconque des revendications pr6c6dentes, dans lequel les valeurs 
de gain produisent une quantity supSrieure d'att6nuation pour les voies k frequence 6lev6e que pour les voies k 
laible frequence. 

7. Systfeme d'6limination de bruit selon la revendication 1 , dans lequel ledit moyen (310, 810) permettant de produire 
une valeur de gain comporte en outre une plurality de tables de gain, chaque table de gain pr6sentant une valeur 
de gain de voie individuelle pr^ddtermin^e correspondant auxdites estimations d'energie de voie individuelle, et 
un moyen de selection de tables de gain permettant de choisir automattquement une table de gain parmi la plurality 
de tables de gain en fonctlon de niveau de bruit de fond global nrKiyen dudit signal d'entr^e. 

8. Syst^me d'elimination de bruit selon I'une quelconque des revendications prdc^dentes, comportant en outre un 
moyen permettant de combiner ladite pluralit6 de signaux post-trait6s pour produire ledit signal de sortie dont on 
a supprim6 le bruit. 
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9. Syst6me d'6Iimination de bruit selon Tune quelconque des revendications pr6c6dentes dans lequel ledrl moyen 
(220) permettant de g6n6rer des estimations comporte : 

un moyen permettant de g^n^rer et de stocker une estimation de la density spectrale de puissance du bruit 
5 de fond desdits signaux pr6trait6s. ledit moyen de g6n6ration d'estimation de bruit de fond comportant un 

moyen permettant de modifier ladite estimation de bruit de fond en r^ponse ^ un paramdtre tempore! indiquant 
i'intervalle de temps s'^tant 6cou\§ depuis la modification de Test Imat Ion de bruit de fond pr^^dente ; et 
un moyen permettant de g^n^rer une estimation du rapport signal/bruit (SNR) dans chaque vote Individuelle 
en fonction desdites estimations de bruit de fond modift^es ; 
10 dans lequel ledit moyen (310, 810) pemnettant de produire une valeur de gain pour chaque vote individuelle 

produit chaque valeur de gain en r^ponse auxdites estimations de SNR de voie. 

10. Syst6me d'6limination de bruit selon la revendication 9, dans lequel ledit moyen de nrKxJificatlon de I'estimation de 
bruit de fond comporte un moyen permettant de produire ledit paramelre temporel, el un moyen permettant de 

15 comparer ledit paramdtre temporel k un seuil temporel predetermine de sorte qu'une modification des estimations 

de bruit de fond est effectu6e lorsque ledit param^tre temporel d^passe ledit seuil temporel. 

11. Syst^me d'6limination de bruit selon la revendication 10, dans lequel ledit moyen de modification de I'estimalion 
de bruit de fond comporte en outre un moyen penmettant de gdn^rer une estimation de I'^nergie dans chaque voie 

20 individuelle, et un moyen permettant de produire un param6tre d'6nergie multivoie en r6ponse k la valeur total e 

de toutes les estimations d'6nergie de voie individuelle. 

12. Syst^me d'6limination de bruit selon la revendication 11, dans lequel ledit moyen de production de param^tre 
d'^nergie multivoie permet des variations mineures des estimations d'dnergie de voie individuelle de sorte que 

25 lesdites variations mineures n'affectent pas de fa^on considerable ledit paramdtre d'^nergie multivoie. 

13. Syst6me d'6limination de bruit selon la revendication 11 ou 12, dans lequel ledit moyen de modification des esti- 
mations de bruit de fond comporte en outre un moyen permettant de comparer ledit parametre d'energie multivoie 
k un seuil d'energie predetermine de sorte qu'une modification des estimations de bruit de fond est effectuee 

30 lorsque ledit parametre d'energie multivoie est inferieur audit seuil d'energie. 

14. Systems d'eiimination de bruit selon la revendication 13. dans lequel ledit moyen de modificatbn des estimations 
de bruit de fond rrxxiifie lesdites estimations de bruit de fond en reponse audit parametre temporel quel que soit 
ledit parametre d'energie multivoie. 

35 

15. Systeme ^elimination de bruit selon I'une quelconque des revendications 11 ^ 14, dans lequel ledit parametre 
d'energie multivoie est genere en traduisant lesdites estimations de SNR de voie individuelle en les metriques de 
parole de voie individuelle, la somme des metriques de parole etant une mesure de toutes les caracteristiques 
ressemblant k de la parole de renergie dans toutes les voies. 

40 

16. Systeme d'eiimination de bruit selon I'une quelconque des revendications precedentes, comprenant en outre : 

un moyen (815, 820, 830) permettant de surveiller lesdites estimations d'energie de voie et de distinguer des 
salves de bruit k bande etrolte de renergie de parole et de I'energie de bruit de fond, ce qui permet de produire 
45 un signal de modification (835) ; 

un moyen (590) penmettant de modifier de fa^on selective lesdites estimations d'energie de voie en reponse 
audit signal de modification (835) de sorte que les estimations d'energie de voie representant des salves de 
bruit k bande etroite soient modifiees ; 

un moyen permettant de produire une valeur de gain pour chaque voie individuelle en reponse k chaque 
50 estimation d'energie de voie modifiee ; et 

un moyen permettant de modifier le gain de chaque signal pretraite de ladite pluralite de signaux pretraites 
en reponse auxdites valeurs de gain pour foumir une pluralite de signaux post-traites. 

17. Systeme d'eiimination de bruit selon la revendication 1 6. dans lequel ledit signal de modification indique le nombre 
55 total de voles individuelles presentant des estimations d'energie depassant un seuil d'energie predetermine. 

18. Systeme d'eiimination de bruit selon la revendication 16 ou 17, dans lequel ledit nrroyen de modification des esti- 
mations d'energie de voie comporte un moyen (830) permettant de comparer ledit signal de modification ^ un seuil 
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de compte pr6cl6termin6 de sorle qu'une modification des estimations d'6nergie de voie est effect u6e lorsque led it 
nombre total de voies indivlduelles est inf^rleur audit seuil de compte. 

19. Syst6me d'6limination de bruit selon la revendication 16, 17 ou IB, dans lequel ledil moyen de modificatbn de 
s gain foumit une quantity maximum d'att^nuation du signal pr6trait6 dans une voie particuli^re ayant une estimation 

d'dnergie de vole modifl^e. 

20. Proc6d6 d'att6nuation du bruit de fond provenant d'un signal d'entr6e bruyant (205) pour produire un signal de 
sortie dont on a supprim6 le bruit (265) dans un systfeme d'6limination de bruit (800) comprenant les 6tapes con- 
to sistant k : 

s6parer (650) le signal d'entr6e en une plurality de signaux pr6trait6s repr6sentant un nombre N de voies de 
frequence choisies ; 

g^ndrer une estimation de I'dnergie dans chaque voie Individuelle (853) ; 
IS g^n^rer et stocker une estimation de la density spectrale de la puissance du bruit de fond desdrts signaux 

pr^trait^s ; et 

g6n6rer une estimation du rapport signalA^ruit (SNR) dans chaque voie individuelle en fonction desdites es- 
timations de bruit de fond et desdites estimations d'6nergie de vole ; 

20 le proc6d6 6tant caract6ris6 par les 6tapes consistent ^ : 

produire (861 , 862) une valeur de gain pour chaque voie individuelle en r6ponse auxdites estinnattons de SNR 
de voie, dans lequel lesdites valeurs de gain pr6sentent un 6ventail de valeurs minimales ; 

25 et dans lequel ladite 6tape de production des valeurs de gain comporte les 6tapes consistant k : 

foumir un seuil de SNR pr6d6fini et comparer (902) lesdites estimations de SNR de voie audit seuil de SNR 
prdddfini de sorte que les voies ayant des estimations de SNR infdrleures audit seuil de SNR produisent des 
valeurs de gain contenues dans ledit dventail minimal ; et 
30 modifier (910) le gain de chaque signal pr6trait6 de ladite plurality de signaux pr^trait^s en r^ponse auxdites 

valeurs de gain afin de fournir une plurality de signaux post-trait6s. 
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INDEX(C€) FOR CHANNEL CO 



86h 
862^ 



INDEX VOICE METRIC TABLE CC 



OBTAIN VOICE METRIC VALUE 
VM(CC) FOR CHANNa CC 



863^ 



850 



1^ 



860 




870 
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FIG.4B 
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OBTAIN BACKGROUND NOISE 
ESTIUATE BNE(CC} 



Jl 



■913 



914 




SaECT GAIN TABLE SET No.1 




SELECT GAIN TABLE SET No.2 



921 



1 



SELECT GAIN TABLE SET No.3 



931- 



SET CC = 1 



F IQ.4E 



910 



-930 
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1 



932 



INDEX SELECTED GAIN 
TABLE PER INDEX(CC) 



T 



'933 



OBTAIN RAW GAIN VALUE 
R6(CC) FOR CHANNEL CC 



STORE RG(CC) K"* 



CURRENT BNErCC) = OLD BNEfCC) + 
SF[NEW BNE(CC) - OLD BNERCC}] 



TO 629 (RESET SAUPLE) 



930 




940 




FIG.4F 
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601 



INITIALIZATION 



602-Nl 



INCREyENT SAMPLE 



603^ 



604 



A. 



± 



INPUT SAMPLE 



605- 
606-> 



PRE-EUPHASIS I 
SET CHANNa = I I 




609-^ 
610 



SyOOTH GAIN I 

uooiFY SFCcnmi I 




FIG. 5 A 
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FIG. SB 




DICREIOT FRAVE 



-629 



CALCULATE INDEX I 



619 



LOOK UP GAIN 



622- 



623 



RESET SAMPLE | 



SIMULATE POST- 
PROCESSED ENERGY 



625 




UPDATE BACKGROUND 
NOISE ESnMATE 



621 
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FROM 603 



PRE-EUPHASIZE 
INPUT SPEECH SAUPLE 



CC = 1. TOTAL = 0 



702 




FILTER SAUPLE WHH 
BANDPASS FILTER CC 



705^ 



FUU-IAVE RECTIFY 
CHANNEL SAMPLE 



706- 



707- 



LOWPASS FILTER 
CHANNa SAUPLE 



STORE CHANNa ESHMATE 



706 



SMOOTH RAW GAIN VALUE 
FOR CHANNEL CC 



709- 



HULTMY FILTER SAMPLE 
BY SMOOTHED CAIN VALUE 
FOR CHANNa CC 



7/0 



ADO CHANNa (CC) SAMPLE 
TO OUTPUT SAMPLE TOTAL 



77/ 



CC = CC + 1 



712 



OE-EUPHASIZE 
OUTPUT SPEECH SAUPLE 

^ 

TO 613 



FIG. 6 A 
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FROy 614 

cc = 1 I 




723 



CALCULATE SNR 
FOR CHANNa CC 



724- 



I 



CHOOSE GAIN TABLE 



I 



INDEX GAIN TABLE TO 
OBTAIN RAW GAIN VALUE 



725 



± 



STORE RAW GAIN VALUE 
FOR CHANNEL CC 



726- 



CC = CC + 1 



TO 621 



FIG.6B 



31 

03/28/2003, EAST Version: 1.03.0002 



730 



EP 0 380 563 B1 



FROM 615 



SET CHANNa 
COUHT(CC) TO 1 




SIM POST-PROCCSSEO ENERGY FOR 
CHANNa CC = RAW GAIN VALUE 
FOR CHANNa CC X PRE-PROCESSED 
ENERGY ESTIMATE FOR CHANNa CC 



733 



INCREMENT CC 



SET CHANNa 
COUNT(CC) TO 1 



± 



^734 



INinALIZE POST- 
PROCESSED 
ENERGY TO ZERO 



735 




•»T0 740 



ENERGY = ENERGY + SIM 
POST-PROCESSED ENERGY 
OF CHANNa CC 



738 



T 



INCREMENT CC 



J FIG.6C 
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740 



FROM 736 



COMPUTE LOG OF COMBINED 
POST-PROCESSED 
CHANNa ENERGY 



742 




DECREASE VALLEY 
40 MS TC 



UPDATE NOISE 
SPECTRAL ESTIMATE 



1 



TO 629 



743 



INCREASE VALLEY 
1 SEC TC 




F IG.6D 
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